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Abstract 

In the paper we propose some new class of functions which is used to construct tail 
index estimators. Functions from this new class is non-monotone in general, but presents 
a product of two monotone functions: the power function and the logarithmic function, 
which plays essential role in the classical Hill estimator. Introduced new estimators have 
better asymptotic performance comparing with the Hill estimator and other popular 
estimators over all range of the parameters present in the second order regular variation 
condition. Asymptotic normality of the introduced estimators is proved, and comparison 
(using asymptotic mean square error) with other estimators of the tail index is provided. 
Some preliminary simulation results are presented. 
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1 Introduction and formulation of results 


From the first papers of Hill and Pickands (see IIS] , [S]), devoted to the estimation of the 
tail index (or, more generally, extreme value index), most of statistics constructed for this 
aim were based on order statistics and logarithmic function. Suppose we have a sample 
Xi,X 2 ,... ,Xn, considered as independent, identically distributed (i.i.d.) random variables 
with a distribution function d.f. F satisfying the following relation for large x: 

F(x} :=1-F(x} = x-^/'^L(x}. (1.1) 


Parameter 7 is called extreme value index (EVI) and a := I /7 > 0 is called the tail index, 
L{x) > 0, for all x > 0, and L is a slowly varying at infinity function: 

L{tx) 


lim 

a;->-cxD L{x) 


= 1 . 


In the paper we consider the case 7 > 0 only. Denote U{t) = F*^ (1 — (l/t))'*~, t > 1, where 
M is the left continuous inverse function of a monotone function IF, defined by 
the relation W^{t) = sup{x : W{x) < t}, t E /. It is well-known that in the case 7 > 0 
assumption (HI) is equivalent to the following one: for all x > 0 , 

U{tx) 


lim 


( 1 . 2 ) 


t -^00 U{t) ’ 

i.e., the quantile function U{t) varies regularly with index 7 . Let < • • • < ^n,n 

denote the order statistic of Xi,, X^. Taking some part of the largest values from the 
sample and the logarithmic function a statistician can form various statistics. In this way 
one can get Hill and Pickands estimators, moment and moment ratio estimators which are 
well-known and deeply investigated. The heuristic behind this approach (the threshold- 
over-peaks (TOP) phenomenon and maximum likelihood) is also given in many papers and 
monographs, therefore we do not provide it here. 

There were estimators, based on a little bit different idea: the sample is divided into 
blocks and in each block the ratio of two largest values is taken. Then the linear function 
/(x) = X instead of logarithmic one is applied to these ratios. Estimators, based on this idea 
were constructed in m and [T 8 |. The next step was to include the linear and logarithmic 
functions into some parametric family of functions, and, considering estimators based on 
block scheme, this was done in [19], taking the family of functions, defined for x > 1 

i (x'’ - 1 ), r < a, r / 0 , 

Inx, r = 0 . 


fr{x) = 


(1.3) 


In po] this family of functions was applied for order statistic. This was done by introducing 
the statistics 


1 


fc-i 


i=0 


Xn—i,n 


X, 


n—k,n 


£ = 1 , 2 , 


2 





and some combinations, formed from these statistics. Here k is some number satisfying 
1 < A; < n, and in the EVI estimation k is chosen as a function of n (thus, strictly speaking 
we should denote it by k{n)). In this way the generalizations of the Hill, the moment, and 
the moment ratio estimators (we shall write the expressions of these estimators a little bit 
later; to denote these estimators we shall use the letters H^M,MR and for generalizations 
we add the letter G) were obtained, for example, the generalized Hill estimator (GHE) is 
defined as 




Hn{k,r) 


1 + r ■ Hn{k,r) 


(1.4) 


(H) (CM^ 

and the Hill estimator is obtained by taking r = 0, 7 n (k) := 7 n {k,0). 

The main goal of this paper is to introduce another parametric family of functions, which 
has the same property that includes logarithmic function, and to construct new estimators 
using this family. For x > 1 let us consider functions 


gr,u{^) = 2 :’'ln“(x), 


where parameters r and u can be arbitrary real numbers, but for our purposes, connected 
with consistency, we shall require yr < 1 and u > —1. Moreover, mainly we shall consider 
only integer values of parameter u. The family {gr,u}, similarly to {/r}, contains logarithmic 
function (with r = 0 ), but, contrary to {/r}, contains logarithmic function for any value 
of parameter r (if tt 7 ^ 0 ). Also let us note that for r > 0 function is monotone for all 
values of u, while for r < 0 and u > 0 there is no monotonicity. 

Using these functions we can form statistics, similar to Hn\k,r): 


k-l 


Gnik,r,u) = T'^gr,z 


i=0 


^n—i,n 


X 


n—k,n 


The above mentioned Hill estimator, moment estimator (introduced in [ 6 ]) and the moment 
ratio estimator (introduced in [3]) can be expressed via statistics Gn{k,0,u), u = 1,2 as 
follows 


Inik) 


Gfiik, 0 , 1 ), 

Gn{k, 0 , 1 ) + - 11 

Gnik,0,2) 

2Gn{k,0,l)' 


f Gn{k,0,2) \ 

\GUkAl) ) 


Let us note, that, due to the expressions of functions fr and gr,u, we can express statistic 
Hn{k,r) via statistics Gn{k,r,u): 


Hn{k,r) 


{Gn{k,r,0) - 1)/r, r 7 ^ 0, 
Gn{k,0,1), r = 0. 


(1.5) 
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and there is continuity with respect to r in this relation, since it is easy to see that 

/Q\ 

lirur-j-o r, 0) — 1)/r = G„(A:,0,1). Taking into account that H\’{k^r) can be ex¬ 

pressed via Hn{k,r) (see (3.2) in [20]), all estimators, which were introduced in [2D|, can be 
written by means of statistics Gn{k,r,u) only. 

One more estimator of the parameter 7 , which can be written as a function of statistics 
Hn \k,r), therefore also by statistics Gn{k,r,u), very recently was introduced in [T] (see 
also m and [ 22 ]). It is named as the harmonic moment tail index estimator (HME, this 
abbreviation is used in [ 5 ), and for /3 > 0 , /3 7 ^ 1 , it is defined by formula 


In 


(fc,/3) 


1 



( 1 . 6 ) 


while for /3 = 1 it is defined as a limit as /3 —)• 1. It is easy to see that 


^^^ik,l)=k-^J2^n 

i=0 


^n—k,n 

^n—i,n 


i.e., we get the Hill estimator. But very simple transformation shows that, denoting /3 = 
1 — r, we have 


7 "“(A;,l-r) = 


1 - (r • H^\k,r) + 1) ^ 


Hk^\k,r) 


= 7n^(fc,r). 


r • Hi^\k,r) -t- 1 


This means, that the HME is exactly the GHE, only additional (tuning) parameters /3 (for 
HME) and r (for GHE) have different ranges. It is strange why there is requirement /3 > 0 
in the definition of HME, which corresponds to the condition r < 1 for GHE. Of course, 
both statistics can be defined for all real (3 and r, but in order to have consistent estimators 
of the parameter 7 we must have some restrictions on these tuning parameters. In [20| the 
consistency of GHE is proved under condition yr < 1 (or r < a), which is equivalent to 
condition /3 > 1 — 7 “^, while in [T] the consistency of HME is stated (there is only the sketch 
of the proof) without any additional condition on /3, so one can think that HME is consistent 
for all /3 > 0. In [20| it was explained why the GHE is not consistent if r > a, and from the 
expression (m intuitively it is clear that for /3 < 1 — 7 ^ we have — /3) 

This discussion even shows that the requirement /3 > 0 is not natural: for 0 < 7 < 1 we 
have 1 — 7 “^ < 0, thus for 1 — 7 “^ < /3 < 0 we still have consistency of HME, while if 7 > 1 , 
then in the interval 0</3<l — 7 “^ there is no consistency. The asymptotic normality 
of HME in [T] is proved under the requirement /? > 1 — ( 27 )“^, which is equivalent to the 
requirement 2'yr < 1, used in |20j . 

Our first result shows what quantities are estimated by the introduced statistics Gn{k, r, u). 
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Theorem 1.1 Suppose that Xi,..., are i.i.d. nonnegative random variables whose 
quantile function U satisfies condition ([h. Let 7 r < 1 and u > —1. Let us suppose 
that a sequence k = k{n) satisfies conditions 

fc(n) —oo, n/A:(n) —oo, as n —>■ oo. (1-7) 


Then we have 


Gnik,r, u)^i{r, u) 


7^r(l + n) 

(1 — 7r)i+“ ’ 


as n ^ oo. 


( 1 . 8 ) 


Here stands for the convergence in probability and r(u) denotes Euler’s gamma function. 


The following corollary allows to proof the consistency of estimators, expressed as a function 
of statistics Gn{k,r,u) with different r and u. 

Corollary 1.1 Suppose that Xi, ..., Xn are i.i.d. nonnegative random variables whose 
quantile function U satisfies condition lll.S\) . Let 'yrj < 1 and Uj > —1, j = 1,2,... ,i. Let 
us suppose that a sequence k = k{n) satisfies Let x(ti ,... ,ti) : (0, oo)^ —>■ (0, oo) be 

a continuous function. Then 

p 

X {Gn{k, ri,ui),..., Gn{k, r^, U()) -^x (^(n, ^i), • • • ug)), (1.9) 


as n ^ oo. 


The relation p.9p gives us many possibilities to form consistent estimators of 7 using 
statistics Gn{k,r,u) with different r,u. Since it is not clear which combinations are good 
ones, we decided to restrict ourselves with two most simple statistics Gn{k,r,Q), Gn{k,r, 1) 
(that is, n = 0 and u = l)and to consider the following three estimators of 7 > 0 


ln\k,r) 

ln\k,r) 

ln\k,r) 


I (G„(/c,r,0) - l)(rG„(fe,r, 0))“^ r / 0, 

\ Gn{k,Q, 1), r = 0, 

_ 2Gn{k,r, 1)_ 

2rGn{k, r, 1 ) + 1 + ^/ArGn{k,r, 1) + l’ 
f {rGn{k,r,l) - Gn{k,r,0) + l){r‘^Gnik,r,l))~^, r / 0, 
1 ln^{k), r = 0. 


( 1 . 10 ) 


One can note, that the estimator fin\k, r) is exactly the GHE, given in ()1.4p . only expressed 
via statistics Gn{k,r,u). For us it will be convenient to use this notation for GHE, since 
we shall compare these two new constructed estimators not with the Hill estimator (which 
earlier was like a ’’benchmark” for other estimators), but with the GHE fin\k,r). Since 
fin\k,ll) = 7 i^\fe, 0 ), the second estimator presents another generalization of the Hill 
estimator, while the third estimator gives us the generalized moment ratio estimator. 

The main step in proving asymptotic normality of the introduced estimators 7 ^ ^ {k, r),j = 
1,2,3, is to prove asymptotic normality for statistics Gn{k,r,Hj, Gn{k,r,l). As usual, in 
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order to get asymptotic normality for estimators the so-called second-order regular variation 
(SORV) condition, in one or another form, is assumed. In this paper we shall use the SORV 
condition formulated by means of the function U. We assume that there exists a measurable 
function A{t) with the constant sign near infinity, which is not identically zero, and A{t) ^ 0 
as t ^ oo, such that 


U (tx) ^ 

-rmv — X’ tP — I 

lim = X^- -^ 

i—>-00 A(t) p 


( 1 . 11 ) 


for all X > 0. Here p < 0 is the so-called second order parameter. It is known that (|l.lip 
implies that the function |j4(t)| varies regularly with index p. 

Let us denote dr{k) = 1 — fcyr. 


Theorem 1.2 Suppose that Xi,..., Xn are i.i.d. nonnegative random variables whose 
quantile function U satisfies eondition Suppose that yr < 1/2 and that the sequenee 

k = k{n) satisfies ra and 

lim \/kA (y') = pG (—oo,-|-oo). (1-12) 

n —>-00 \k / 


Then, as n ^ oo, 


'/k{Gn{k,r,0) - ^{r,0),Gnik,r,l) -f,{r,l))Ap + , (1.13) 


where stands for the convergence in distribution and quantities j = 1,2 are as 

follows 

1 — p — 


uW{r) = 




dr{l){dr{l) - pY " {dr{l)Y{dr{l) - pY' 

Here (VL^^^IR^^^) is zero mean Gaussian random vector with variances E = s^{r), 

j = 1,2 and covariance E = si 2 (r), where 


Si r = 


7"^“ 2/ \ 7^(dr.(2)-I-27^r'^) 7^r(dr(l) — y^?'^) 


.,2.^2 


-1 sfir) = 


, si2(,rj = 


dr{2)df{iy ' rfr(2)4(l) 

From Theorem O we derive the main result of the paper 


dl{2)dl{l) 


(1.15) 


Theorem 1.3 Under assumptions of the Theorem MSA 


where 


\fk {fiY\k,r) - 7 ^ Am {pvj{r),a‘j{r)) , j = 1,2,3, 


vfir) 


p'2{r) 




driY 2f N ^ 7^d^(l) 

dr(l)-p’ ’ dr{2) ’ 

d,(l)(l - p - 7^2) 2. ^ 7"^^.'(1)K(2) + 

(l + 7r)(d,(l)-p)2’ (l + ^r)2d3(2) 

dKY 2f 1 ^ 27 ^d/(l) 

(dr(l)-pY’ d3(2) • 


(1.16) 
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Having asymptotic normality of the introduced estimators in Section 2 we compare the 
asymptotic mean square error (AMSE) of these estimators. In [20] we showed that the GHE 
jn\k, r) dominates the Hill estimator in all region of the parameters 7 and p: 7 > 0, p < 0. 
Although this domination is rather small (see Fig. 2, the right graph ), but theoretically 
it is important, since, as far as we know, this is the first estimator, which asymptotically 
performs better than the Hill estimator in all region of possible range of two parameters 
7 and p. Now we compare estimators ^n\k,r), j = 2,3, with ^n\k,r), and the results 
are promising. First of all, GMRE substantially outperforms MRE in all region of the 
parameters 7 > 0, p < 0, see Fig. 2, the left graph. Fig. 3, the right graph presents 
comparison of GMRE with GHE, and asymptotic result (solid line) shows that no one 
estimator dominates in all region —00 < p < 0 (the ratio of AMSE does not depend on 7), 
while simulation results (points) demonstrates the domination of GMRE for all values of p, 
for which the simulation was performed. 

Finally, we formulate some results concerning the robustness of the introduced estimators. 
We follow the paper [Tj, where robustness was considered for the HME (or in our notation, 
for ^^\k,r)). 

To define the robustness measure for the estimator ^n\k,r), instead of ^n\k,r) we will 
use the notation 7^ (fc, r; Ai,..., A„) . Let us define 

^lnHk,r,x) = j^'>{k,r;Xi,.. .,Xn-i,x) - - l,r; Ai,..., A„_i). 


Then, following [T], for hxed n and k we take a quantity 

B^\k,r) = lim Aj^\k,r,x) 

X^OO 

which measures the worst effect of one arbitrarily large contamination on the estimator 
^n\k,r). For fixed n and k these quantities are random variables, but it turns out that 
asymptotically they become constants, depending on 7 and r (here it is appropriate to note, 
that results on robustness are based on Theorem 0 thus there is no dependence on p). 

Let us denote by ;= B^^\'y,r) the limit in probability of Bn\k,r), I = 1,2,3, as 
n —>■ 00 and (11.711 holds. 


Theorem 1.4 Suppose that Ai,...,A„ are i.i.d. nonnegative random variables whose 
quantile function U satisfies condition MdXi . Let yr < 1. Then we have 


bU) 


0, r < 0, 

< 00 , r = 0 , 

(1 — 7 r)/r, 0 < r < 1 / 7 . 


J = 1,2,3. 


Assuming the SORV condition (11.111) we were able to find optimal values of r for 7 ^ ^ (/c, r), 
j = 1,3 (see formulas (|2.7p and (|2.8p in Section 2), therefore, for the generalized Hill 
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estimator we get r|) = 7(1 — p + . For the generalized moment ratio 

estimator the situation is even better, since optimal value < 0 , therefore = 0 , 

while 0 ) = 00 . At first it seemed for us little bit strange, that for all three estimators 

we get the same value of B^^\ but looking more carefully to the construction of this measure 
of robustness, we realized that it is quite natural and even the proof is almost trivial. Since 
the second term in the expression of A^n\k,r,x) is independent of x, moreover, it tends 
to 7 , we have 

= lim r; Ai,..., X„_i, x) - - 1, r; Ai,..., A„_i). (1.17) 

x—>oo 

Thus, robustness of the given estimator depends essentially on this first limit, which can be 
zero, infinity of finite, depending on the term {xjXn-k,n)- For all classical estimators 

tends to infinity, therefore we get infinite value for robustness, 
while for all three generalizations we are getting that this limit asx—^-ooisl/r, ifr >0 and 
is 7 , if r < 0, thus we are getting result of Theorem 1 1.41 Moreover, the proof of this theorem 
shows that we can contaminate the sample not by one large value, but by several, and the 
asymptotic result will be the same - generalized estimators will remain (asymptotically) 
robust. 

The rest of the paper is organized as follows. In the next section we investigate asymp¬ 
totic mean square error of the introduced estimators, and compare these estimators with 
the generalized Hill estimator, using the same methodology as in [T^j, and provide some 
simulation results. Then there are formulated conclusions, and the last section is devoted to 
the proofs of the results. At the end of the proof of Theorem 11.31 we discuss the alternative 
proof of this result based on the paper [7] . 

2 Theoretical Comparison of the Estimators and Monte-Carlo 
simulations 

As in [ 12 ] we can write the following relation for the asymptotic mean squared error of the 
estimator 7 ^^^ (k, r) 

AMSE(#(fc,r)) ~z.|(r)A2(^)+^, £ = 1,2,3, (2.1) 

where a sequence k = k{n) satisfies (I1.12p . Here and below we write a„ ~ bn if a-n/bn —^ 

1 as n ^ 00 . As in |2D], assuming that r is fixed, we perform the minimization of 
AMSE (^^n\k,r)^ with respect to k . We will obtain optimal value of k{n) for the es¬ 
timator 'yn\k,r), which will be denoted by kg{r). Then we minimize asymptotic mean 
squared error AMSE ^ 7 ^^^ (fc|(r),r)^ with respect to r. 

Thus, let us assume that r is fixed. We define the function a by the following relation: 

/ OO 

a{u) du, t ^ OO. (2-2) 
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By applying Lemma 2.8 in [5] we get that minimum of right hand side of (12.111 is achieved 
with 


kKr) = 


cr: 




l/(l-2p) 


n 


vj{r) J o'^(l/re)' 

Following the lines in m and using assumption (11.121) . we get 


k}{r)A^ 


n 


M(r)J i-2p)u‘^(r)' 
Substituting (|2.3p and (12.41) into (|2.ip we obtain 


n 


oo. 


(2.3) 


(2.4) 


AMSE (# mr),r)) ~ (r/|(r) {aj{r)) (2.5) 

Now we minimize AMSE (^|(^)5^)^ (more precisely, the right-hand side of ()2.5p l with 
respect to r, and for this it is sufficient to minimize the product r'l(r) ((T^(r)) with respect 
to r. Let us note that asymptotic parameters viir), 1 < .^ < 3 depend on parameters p and 
yr, while quantities af{r)j^‘^, 1 < i < 3 depend on the product yr only. Therefore, it is 
convenient to introduce notation R = 'jr and to consider minimization of the function 


ve{R) = {(y\{Rh)) , 


with respect to R satisfying inequality R < 1/2. Equating the derivative of this function to 
zero, we get 


crj{Rh) 


dvi>{R/l) 

dR 


pvi{Rli) 


dnl{Rli) 

dR 


= 0 . 


( 2 . 6 ) 


By substituting the values of and a\{Rl^^ into equation (12.6p we get the equation 

— R(2 — /?) -|- p = 0. Whence it follows that the optimal value of R for the estimator 
4n m{r),r) is _ 


(2 - p) - V(2 - p)2 - 4p 

r> 5 


(2.7) 


since the second root of the quadratic equation does not satishes the relation i? < 1/2. As 
for the estimator (A:jj‘(r), r), the optimal value of the parameter r was found in [20) . and 
in our notation (it is necessary to note, that SORV condition in [20) was used with different 
parametrization, see (1.3) therein) the optimal value of R is 


Rl = —— --. ( 2 . 8 ) 

Unfortunately, the situation with the estimator 7^ (A;2(r),r) is more complicated. By 
substituting the expressions of i' 2 {Rj'y) and a'^iR/'j) into ()2.6|) we get the equation 


2R^ - 2i?®(l - p) - 2i?^(5 - 3p) + 2ii®(p2 - 3p + 6) - 2pR^{5 - 2p) - 6R^(1 - pf 
+R^{^p^ - 22p + 15) - 2R^{bp^ - Up 9) AR{p‘^ - 3p -h 2) - (1 - p) = 0. 
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For a fixed given value of p this equation of 9-th order was solved with ’’Wolfram Mathe- 
matica 6.0”. It turns out that depending on the parameter p it has 3 real roots and three 
pairs of conjugate roots or 5 real roots and two pairs of conjugate roots. All real roots were 
substituted into the function rj 2 {R) and optimal value was found in this way. Numerical 
values of optimal value i?2 as a function of p are provided in Fig.l. Although we got ex¬ 
plicit and very simple expressions for the optimal values Rl and R^, we provide these two 
functions (as functions of p) in the same Fig.l. 



Figure 1: Graph of the functions R\{p) (solid line), R^ip) (dashed line) and R^{p) (dotted 
line), —10 < p < 0 


Let = R}I'J denote optimal value of the parameter r. From this picture we see that 
the first two functions R*, i = 1,2 has comparatively small range of values (for R^ip) the 
range is (—1,0) ), this means that optimal value of parameters r* and are not sensitive 
to the parameter p, but more sensitive to 7. 

Now we are able compare the generalized Hill estimator (/cj|‘(rj‘),r|) with the an- 
other generalization of the Hill estimator 7n (fc^rl), r^) and generalized moment ratio 
estimator 7)1 ’ (kail's)^ before performing comparison of these estimators, at first 

we demonstrate that generalized moment ratio estimator {k^{r^),r^) outperforms the 

/oX 

initial moment ratio estimator 7^ (/c3(0),0) in the whole area {(7, p) : 7 > 0, p < 0}. 
Denoting 


V’mk(p) 


AMSE (A:|(0),0)) 
AMSE (# (A:|(r3 *),r 3 *))’ 


it is not difficult to get that 


V’M/?(p) 


( 2 {v{p) - p)^ (u(p) - l + p) 

[ (l-p)4(^(^) + p)4-8p 



l/(l-2p) 
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where v{p) = ((2 — — 4/?)^'^^. Since we must investigate this function on negative half- 

line {yO < 0}, it is convenient to denote —p = x and to write 

^mr{x) = ^mr{-x) = 

with 

2“®® (h(x) + (■u(x) - 1 - x)®"" 1 

=- (l+x)-(^W-.r“ -’ 

where -u(x) = v{—x). Taking logarithm of iPmr{x), using the fact that / is product of 
several elementary functions, and using the simple relation v{x) — x = 4 + 0{x~^), one can 
get 

~ 27 

lim lmpMR{x) = 31n3 — 41n2, or lim iPmr{x) = — = 1.6875 

x—>-oo x—>-oo 16 

In a similar way one can get 

lim In-tpMRix) = 0, or lim -^MRix) = 1. 

X —>0 X —>0 

As a matter of fact, ipMRiO) = 1, but considering asymptotic normality and AMSE of 
estimators under consideration we excluded the case p = 0, therefore we calculate this last 
limit. More difficult is to show that the function ^mr{x) is monotone and increasing (or 
i’MRip) is decreasing), we skip these considerations, only we mention that we use the fact 
that the logarithmic derivative of a product of functions is a sum of logarithmic derivatives 
of these functions. The graph of the function p < 0 is provided in Fig.2 in left. 

In the same picture we gave also results (in form of separate points) of simulations, which 
will be explained below. Surprisingly, simulation results are even better than theoretical 
asymptotical result - most points are above the graph of '4^mr{p)- 




Figure 2; Graph of the functions V'mr(p) (solid line, in left), ipHip) (solid line, in right) 
and results of Monte-Carlo simulations (points) 

Now we can compare this result with the analogous comparison of the generalized Hill 
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estimator with the Hill estimator. If we denote 


AMSEIt^ (fet(O),^ , 

i’nip) = lim - 

™ AMSE {kt{rl),rl)) 


then we have 


'ipHip) = 


{1-Rl-p)'^{l-2Rl)-'^py^^^ 


(l-p)2(l-i?J)2-4p 

In [20] it was shown that the improvement of the generalized Hill estimator over the Hill 
estimator is important only theoretically since the maximal value of the function ijjH is 1.05, 
see Eig.2 in right (this is the same graph as in Eigure 2 in |20j . only now we added results of 
simulation; in this case simulation results are in good correspondence with theoretical ones). 
Thus, comparing with the generalized Hill estimator, we have substantial improvement of 
the moment ratio estimator, since in a big range of the parameter p the function ipMR is 
bigger than 1.3, and the maximal value is close to 1.7. 

Now we return to comparison of estimators (^^2(^2)) ^2) ^'^d ^ 3 ) with 

the generalized Hill estimator rjO, and we must investigate the following two 

functions: 

AMSE 


p^(p) := lim -——- 

™ AMSE (fc;(r;),r|)j 


= 2,3. 


It is important to note that both functions are independent of 7 and depend only on p. In 
view of (|2.5p we have 


'^i{p) = 


' (rt)) 


-2p \ l/{l-2p) 


£ = 2,3. 


Since we were able to obtain the optimal value of i?2 only numerically, we can provide only 
a numerically obtained graph of the function (p 2 {p), see Eig.3 on the left, therefore, in this 
case we did not provide simulation results. 

Although this graph allow to believe that the new generalization of the Hill estimator 
dominates the generalized Hill estimator (which is the same as HME) in all region of pa¬ 
rameters {(7,/?) : 7 > 0, p < 0} , but without explicit expression of the function p 2 {p) we 
can not prove this. 

Einally, comparing two generalizations of the Hill and moment ratio estimators (which, in 
our opinion are the most successful, since for both of them we have quite simple expression 
for optimal value of = 1, 3 )we have 


^3{p) = 


3-6p(u(p)-p)^-^P(^p) + (1-p))-2^ 

43-5p( 1 _ 2p)2 (y(p) + (1 _ p))-6P (^(^) _ p)4-4p 


l/{l-2p) 
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Figure 3: Graph of the functions ^P 2 {p) (on the left), ip^{p) (solid line, on the right) and 
results of Monte-Carlo simulations (points) 

where w{p) = ((2 — p)^ — 2) .As can be seen from Fig. 3 in right, generalized moment 
ratio estimator (^3(^3)) ^3) dominates the generalized Hill estimator 7^^^ (^i(^i))^i) for 
p G (p, 0), where p « —4.57018. It is not difficult to show that 

33 

lim P3(p) = 7^ = 0.84375. 

p^—oo 

And empirical results, as in the case of function 'iPmr{x) in Fig. 2 (in left), are somehow 
unexpected - almost all empirical points in the figure are not only above the graph of 
but they are bigger than 1, this means that the empirical MSE of generalized 
moment ratio estimator is smaller than the empirical MSE of the generalized Hill estimator 
for all values of p in the interval —10 < p < 0, not only for p G (p, 0), as gives asymptotic 
theoretical result. 

Now we shall provide some results of Monte-Carlo simulations (part of these results are 
given in Figures 2 and 3 together with theoretical results). We must admit that these results 
are very preliminary, since for practical application of the proposed estimators still we are 
facing with problems of estimating the parameters p and r, and the main problem is with 
estimation of p (and this problem is for almost all estimators, if we want to have asymptotic 
normality of the estimator), since optimal value of r is expressed as a function of p. We 
intend to return to this problem in a separate paper. 

For simulations we shall use a little bit more restrictive condition than (II.lip , namely, we 
assume that the distribution function F{x) under consideration belongs to the Hall’s class 
of Pareto type distributions 

1 - F{x) = (^) ® ) ’ a: ^ 00, (2.9) 

where C > 0, /3 G R \ {0} and p < 0. This assumption is assumed for the following reason. 
Taking the ratio of AMSE of two estimators we do not need to know the function a^, but 
for simulations, having given sample size n, we must calculate the value of k^{r) in (12.31) 
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and the empirical MSE of estimators, and for this we must have the function a'^. Assuming 
(EJD, we have that the second order condition (|l.lip holds with A{t) = and from 
it follows that a'^{t) = (—ii/( 2p-i)^ Now we can rewrite (|2.3p as follows: 


kKr,l3,p) = 




—2pl3‘^j^Up{r) 


l/(l-2p) 


n 


-2p/(l-2p) 


In fact, quantities ki{0, j3, p), i = 1, 3 depend on /3, p and n only, thus replacing (3 and p by 
some estimators [3n and pn-, we obtain the empirical values of the parameter k{n) for the 
Hill and the moment ratio estimators: 


k\^n 

^3,n 


Al-Pn)^ 

V 

- Pn)^ 

V -Pn^n 


l/(l-2p„) 

^~2p7T,/(1 —2p^) 


l/(l-2pn) 

^ —2pn/(1 —2pri) 


( 2 . 10 ) 

( 2 . 11 ) 


For corresponding generalized estimators we have additionally to take estimators of optimal 
parameter R, therefore we have 


ATgn = 


K.n = 


(1 Pn Ri {Pn))‘^ 

^~‘^PnPnO- ~ 2-^1 {pn)) ^ 

{I- Pn-RliM)^ 

-pJl{l-2Rl (p0)^ 


l/(l-2p„) 


n 


-2p„/(l-2p„) 


l/(l-2p„) 


n 


-2p„/(l-2p„) 


( 2 . 12 ) 

(2.13) 


Thus, in our simulations the comparison is made between the Hill estimator 7^^^ ( A:i^„,0 


:,( 1 ) 


Ki^n,rl (7!^^ 


ki,n, 0) , ) ), the moment ratio estima- 


:,(3) 




A3) 

Tn 


^3,ni 0 j , Pn 


the generalized Hill estimator 7n 

tor 7^^^ (j^ 3 ,n, 0^ and the generalized moment ratio estimator jp 
with parameters given in p2.10p -(2.13l. 

We generated 1000 times samples Ai,..., A„ of i.i.d. random variables of size n = 1000 
with the following two d. f. with the extreme value index 7, parameter p and satisfying 

(ESP: 

(i) the Burr d.f. F[x) = 1 — (l + x > 0; 

(ii) the Kumaraswamy generalized exponential d.f. F{x) = 1 — (l — exp 
X > 0. 

The parameter (3 which is present in (12.9p for the Burr distribution is 1 and for the 
Kumaraswamy distribution - 1/2, and (7=1 for both distributions. To calculate the Hill 
and the generalized Hill estimators we used the following algorithm: 

1. Estimate the parameter p by the following estimator proposed in [8]: 


Pn{k,T) = - 


(T^Alk\ - 




w-i) (Alt) 


-3 
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where 


-1 


Tic\k) = [{GnikMY - (G4fc,0,2)/2)"/2) ((G4A:,0,2)/2)"/2 _ (G„(fe, 0,3)/6)"/3 
with r > 0, and 

^(0).. ^ ^ In {GYK 0,1)) - (1/2) In (G„(fc, 0, 2)/2) 

" ^ ^ (1/2) In {Gn{k, 0,2)/2) - (1/3) In (G^fc, 0, 3)/6)' 

To decide which values of parameters r (0 or 1) and k to take in the above written estimator 
we realized the algorithm provided in m- 

2. To estimate the parameter /3 use the estimator /3„ {k, pn{k,T)), where 


f^n (k, p) = 


and Wi = i\n.[Xn+i_i^n/Xn+i^n)^ 1 < i < A; < n. This estimator was introduced in [TO] 
Also, as in the step 1 to choose the parameter k we applied algorithm from m- 

3. By using (l2lTl estimate parameter k for the Hill estimator and then obtain 0 

4. Estimate Rl (pn) and ^7^^^ (ki,n,0^ , Pn^] 

5. By using (|2.12|) estimate parameter k for the generalized Hill estimator and find 
estimate [Ki^nXi [in^ (^i,n,0) ,pn 


(b'{ 

(lEbi «)■') 

(i E‘.i tr,) 

-1 

IiEbi arY} 

(W! 

U(i)-') 

(ie: 

LarY 

1- 

(iEt. (i) ■'"»',) 


We used the similar algorithm (with obvious changes) for the moment ratio estimator 
In'^ (j^3,n,0^ and the generalized moment ratio estimator 7® 

Having values of the estimators we calculate MSE and bias of these estimators and also the 
ratios of some pairs of MSE. 

The results of simulations are summarized in Fig. 4 (for the Burr distribution) and in 
Fig. 5 (for the Kumaraswamy distribution). For the Burr distribution we took parameters 
7 and p in the intervals (0,4] and (—5,— 0.2], respectively. In Fig. 4 (left) we divided this 
rectangle (0,4] x (—5,—0.2] into squares Ajj = (i/10, (z + 1)/10] x (—(j + 1)/10, —j'/lO], 
i = 0,1,..., 40, j = 2,2,, , 50. By taking true values of parameters 7 and p as coordinates 
of the center of the rectangle Aj^-, we performed Monte-Carlo simulations. We colored the 
square Ajj in black if empirical MSE of the estimator 7^^^ ^7^^^ 

is the smallest among all four estimators under consideration, while areas of domination 
of the estimators 7n^ (^fei,n,0^, (^3,n,0^ and jP [k 2 ,,n,rl (^h,n,0^ ,Pn^^ are in 

dark grey, grey and in white, respectively. In Fig. 4 (right) there are given the areas of 
domination of absolute value of the bias using the same colors as in the left picture. 

We faced with serious difficulties while estimating the second order parameter p when 
samples Ai,..., A„ are simulated from Kumaraswamy distribution with p < —2 (for many 
samples computer was getting infinite value of sums involved in the estimator of p), thus 
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Figure 4: Empirical comparison of the estimators by using Burr distribution 



0123401234 


y y 


Figure 5: Empirical comparison of the estimators by using Kumaraswamy distribution 

we restrict our simulations in the rectangle (7, p) G (0,4] X [-2,-0 .2], The corresponding 
areas of domination of MSE and bias for this distribution are presented in Fig.5. 

Fig. 4 and 5 demonstrates that areas of domination almost do not depend on parameter 7 
and essentially depend only on p. This corresponds well to theoretical results that functions 
^lip), £ = 2, 3, ipnip), and ^I^mr{p) depend on p only. Therefore, taking the particular value 
7 = 1 and the hundred of values of p in the interval (—10,0) for the Burr distribution we 
obtained ratios of empirical MSE’s to complement theoretical comparison and included these 
ratios (as separate points) in Fig. 2 and 3. From Fig. 2 (right) we see that empirical results 
on comparison of MSE of the generalized Hill estimator with MSE of the Hill estimator 
reflects quite well theoretical (asymptotic) function ipnip) - points are on both sides of this 
function. Little bit unexpectedly, Fig. 2 (left) and 3 (right) reveals that empirical results 
differ from theoretical ones. In Fig. 2 (left), where generalized moment ratio estimator is 
compared with the moment ratio estimator, in the interval —5 < p < 0 empirical points are 
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very close to the theoretical function 'ipnip), but in the interval —10 < p < —5 all points are 
above the theoretical curve, this means that the moment ratio estimator has MSE almost 
two times bigger that MSE of the generalized moment ratio estimator. Empirical results 
in Eig. 3 (right) shows that the generalized moment ratio estimator performs better than 
GHE for all values of p in the interval —10 < p < 0, while theoretical result predict such 
result only for p € (p, 0). 

3 Conclusions 

We introduced a new parametric class of functions which allows to construct many new 
generalizations of well-known estimators, including such as the Hill, the moment, and the 
moment ratio estimators. We prove asymptotic normality of all these generalized estimators 
in a unified way and demonstrate that in the sense of AMSE new estimators are better 
than the classical ones, especially promising looks GMR estimator. We hope that this new 
parametric class of functions will be useful in the difficult problem of estimating the second 
order parameter p. 

Preliminary simulation results show quite good correspondence with the obtained theo¬ 
retical results, but we admit that future work on the behavior of new estimators for middle 
size samples is needed. 

4 Proofs 

Proof of the Theorem M . R There is nothing to prove in the trivial case r = u = 0. Keeping 
in mind relation conclusion (11.811 is the immediate consequence of the Theorem 1 in 

PU] in the case u = 0 and yr < 1. The case r = 0, u> 1 was investigated in [^. 

Consider the case r / 0 and u > —1, rt 7^ 0. Let us recall that the function U{t), t>\ 
varies regularly at infinity with the index 7, thus, by applying Potter’s bound we have: for 
arbitrary e > 0 there exits to; such that, for x > 1 and t > to, 

(d - oxy-< ^ < ai + e)xr+’. ( 4 , 1 ) 

In order to apply (|4.1I1 for the function P it is convenient to introduce the notation e±(r), 
where e±(r) = e, if r > 0, and e±(r) = —e, if r < 0. Then we get 

(^)’'<{(l + £±(r))U’'‘’+'*‘’'». (4.2) 

Similarly we get the following inequalities 

(7 - £±(«))" ln“ ((1 - e±(«))x) < ln“ <h + e±('‘))“ In” ((1 + t±{«))x ), (4,3) 
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By multiplying inequalities (14.21) and (I4.3p . we obtain 


cia.K7-^±M) ((1 


e±(M))x) < gr,u 



< ln“ ((1 + e±(u))x), 


where 


Cl = (7-c±(u)r(l-6±(r)r(^-^±W), 

C2 = (7 + e±Mr(l + e±(r)r(^+^±(")). 

Let Yi,... ,Yn he i.i.d. random variables with distribution function G{x) = 1 — (1/x), x > 1. 
Taking 

^ h^n—k,ni ^ 'h^n—i,n/YYi—k,n 


for z = 0,1,... , A; — 1 we get 


Cl 


C2 


Y 

^ n—i,n 

Yn—k,n 

Y 

^ n—i,r 


r{'y-e±{r)) 


lnM(l-6±(u)) 


Y 

^ n—i,n 

Yn—k,n 


Y Qr,! 


(U{Yn-i,n) 

\UiYn.k,n) 


Yr 


n—k,n 


r('y+f^±{r)) 


1 u i , / \\h^n-i,n\ fU{Yn-in)\ 


Note that U{Yi} = Xi, i = 1,2,... ,n, thus 


Gr 


n > 1 ^ (V{Yn-i,n) 


From this equality, by summing inequalities (I4.5p and (l4.6l) . we get 
k-l 


Cl 

~k ^ 


V \ »’(7-e±(?’)) / V 

^n—i^n \ 1 u / /I f \\ 

' In (1 - e±(u)) 


t ^ \Yn—k,n. 

2=0 ^ ^ 

k—l / tr \ r( 7 +e±(r)) 


C2 

T 2^ 


Y 

^ n—i,n 

^ ^ \Yn—k,n 
2=0 ^ ’ 


ln“ (1 + e±(u)) 


Yn—k,n 

Y 

^ n—z,n 

Yn—k,n 


< Gn{k,r,u) 


> Gn{k,r,u). 


(4.4) 

(4.5) 

(4.6) 


(4.7) 


(4.8) 

(4.9) 


By means of the standard argument (see e.q. 0 ) one can deduce that the left-hand side of 
(14. 8p equals (in distribution) to the sum 


2=1 


The expectation of this quantity equals to ci (Ai + A2), where 


Ai = 
A2 = 


/ OO 

x'’(^-"±(^))-2ln“(x)dx, 


ln(x) 


— 1 > dx. 
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One can verify that 


a—2 i_6/ 


ln®(x)dx = (1 — a) ^ ^r(l + 6), a < 1, 6 > —1. 


(4.10) 


By using the last identity, assumptions 1 — 7r>0, u>—1 and the fact that re± > 0 we 


get 


ciAi = 


(1 - e±(r))^(^-^±W )(7 - e±(u))“r(u + 1) 


(1 — r(7 — e±(r)))“+^ 

Whence we get ciAi —)■ ^(r, u), as e ^ 0. 

Consider a quantity A2 now. If 0 < u < 1, then e±{u) = e and we use the following 
inequality which holds for any real numbers a and b: ||a|“ — |6|“|<|a — 6|“. We have 


/ C30 

^r(7-e±(r))-2^^ 

|ln(l-e)r 


l-r( 7 -e±(r))' 


Now it follows that C1A2 —)• 0, as e ^ 0. In the case —1 < u < 0 we have e±{u) = —e, and 
since 0 < —u < I, applying the same inequality as above, we get 


|(ln(x)+ln(l + e))“ 


ln(x) “ — (ln(x) + ln(I + e)) “ 
^(x)"^ (ln(x) + ln(I + e))““ 
^ |ln(l + 6)|- 

~ ln(x)““ (ln(x) + ln(I + e))““ 


Now we take small h > 0 such that u — 5 > —I, for example, one can take 5 = l/2{u + 1). 
Keeping in mind that ln(I + e) > 0, we can estimate 

I ^ 1 

(ln(x) + ln(I + e))““ ~ (ln(x))‘^ (ln(I + 

Collecting the last two estimates we get 

/ OO 

^ r -( 7 -.±( r ))-2 

{ln{l + e)fT{l + u-5) 

(l-r(7-6±(r)))i+"-'' 


This allows to deduce that C1A2 —)■ 0, as e ^ 0. 

Let u > 1. By using inequality ||a|“ — |6|“| < u (|a|““^ — \a — b\, which holds for 

any real numbers a and b, we get 


/ OO 

^r('r-e±{r))-2 ln““^(x)dx 

2uT{u) |ln(I — e±(u))| 

(l-r-( 7-6±(r)))“ ’ 
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which implies ci A2 —>■ 0, e —)• 0. 

Thus, by applying the Khintchine weak law of large numbers, left-hand side of (j4.8p con¬ 
verges to zero in probability. In a similar way we can prove that the left-hand side of (|4.9I) 
tends to zero in probability, too. Theorem 11.11 is proved. □ 


Proof of Corollary \1.1[ From (11.81) it follows 

p 

{Gn{k, ri,ui), ..., Gn{k, re, ue)) (?(ri, ui),..., C(r£, ue)) , 
as n —>■ 00 . Applying Theorem 2.7 in [2] we derive (foil . □ 


Proof of the Theorem M.^ Let r = 0. Theorem 3.2.5 in [4j states 


Vk {Gn{k, 0,1) - 7) -^Af , 


n 


oc. 


(4.11) 


The relation (14.lip , together with \/fc 0,0) — 1) ^ 0 and Theorem 3.9 in [2], gives 

(I1.13P for r = 0. 

Consider now the case r / 0. Adjusting Potter’s type bounds (3.4) in [2D] for our purposes 
(such adjustment is needed since the second order condition (|l.lip and the corresponding 
condition in [20| are slightly different), we get that for possibly different function Ao(t) with 
Ao(t) ~ A(t), as t —>■ 00, and for each e > 0, 5 > 0 there exists a to such that for t > to, 


X >1, 


9r,o “ rx'^'’Ao{t)fp{x) 


< erx'^'^+P+^ \Ao{t)\, 


(4.12) 


where fp{x) is defined in ()1.3p . It is well-known that a similar Potter’s type bounds hold for 
the logarithmic function, see, e.g., inequalities (3.2.7) in Hj. Namely, for a possibly different 
function Ai{t) with Ai{t) ~ A{t), as t ^ 00, and for each e > 0, 5 > 0 there exists ti such 
that for t > ti, X > 1, 


In 



-7ln(x) - Ai{t)fp{x) 


< €xP+^ \Al{t)\ . 


(4.13) 


Let i = max{to,ti}- By multiplying inequalities (14.121) and (14.131) we get that for t > i, 
X > 1, 


- In (7^) + '^Mt)fp{x)} 
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+x^^{l + rAo{t)fp{x)}{-fln{x) +Ai{t)fp{x)} 
<e\x'^^+^P+^^\Ao{t)\\A,{t)\. 

Suppose that i is large enough that for t > t, x > 1, 


7ln(x) + Ai{t)fp{x) >0, 1 + rA(i{t)fp{x) > 0. 

Then, by applying inequalities (14.121) and (I4.13p one more time we obtain 

- edi{x,t) < gr,i ~ ^i(®) “ ci(x,t) < edi{x,t), (4.14) 

where 


6 i(x) = 7x'^’’ln(x), 

ci(x,t) = {Ai{t)+jrAo{t)ln{x))x'^'~fp{x) + rAo{t)Ai{t)x'^^fp{x), 
di{x,t) = x^^+P+^(^\A,{t)\{l + rAo{t)fp{x)} 

+r |Ao(t)| {7ln(x) + ^i(t)/p(x)} + erx^+'^ |^o(i)| \Ai{t )\^ , 


To prove two-dimensional Central Limit Theorem (11.131) we shall use the well-known 
Cramer-Wald method. Let {6o,0i) G R^. From (|4.12l) and (|4.14l) we get 

'U{tx) 


- ed{x, t) < eogr,o 




U{t) 


— b{x) — c{x,t) < ed{x,t), 


where 


(4.15) 


b{x) = Oox'^^ + 6ibi{x), 
c{x,t) = 9orAo{t)x'^'^fp{x) + 9ici{x,t), 
d{x,t) = \9o\rx'^''^P~^^\Ao{t)\+ \9i\di{x,t). 


We claim that 
fc-i 


1 


\/k 


i=0 


Y 

J- n .— 


n—i^n 


Yr 


n—k,n 


: ^n—k,n I 


r\9ofj.\ 


{1- p-5)\9ig\ 

dr{l) — p — 6 ' {dr{l) — p — sy 


+ 


From Lemma 1 in |20j we know that if u < 1, i' ^ 0, then 

, fc-i 




Similarly one can prove 


i=0 


k-l 

i=0 




En- 


n—i,n \ P 1 
>- 


n—k,n 


Yn—i,n \ P 


1 - Z/’ 


n ^ oo. 


Yn-k,nJ (l-l^)2’ 


n —>■ oo. 


n —oo. 


(4.16) 


(4.17) 


(4.18) 
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The relation 

'/kA{Yn-k,n)^fJ', n^oo, (4.19) 

where /i is the same as in (|1.12p. is proved in m- Now, by combining (j4.17p - (l4.19p one can 
obtain (I4.16p . 

Taking into account (14.161) . substituting the values of t and x from (14.41) into (I4.15P and 
performing summation over i = 0,1,..., fc — 1 we get distributional representation 

'/k {00 {Gn{k, r, 0) - ^(r, 0)) + Oi {Gn{k, r, 1) - ^(r, 1))} 

=VkBn{k,r) +'JkGnikjr) + Op{l), (4.20) 


where 


Bn{k,r) 

Gnik,r) 



By applying relations (|4.17D - (I4.19I) one more time, we find 

\/kCn{k, r)-^/r , re —>• oo. 


(4.21) 


where j = 1,2 are defined in (I1.14p . By using well-known Renyi’s representation 

(see e.g. Section 2 in [20] for details) we obtain 


VkBn{k,r)=Bn{k,r), 


(4.22) 


where 


Bn{k,r) 


k—\ 

i9-yr,o(yi) - ?(^ 0 )) + (^9-tr,l{Yi) 



Keeping in mind equality (|4.10l) one can deduce that quantity Bn{k,r) presents normalized 
sum of zero mean random variables, which are i.i.d.. Moreover, under assumption yr < 1/2, 

E|6lo(57r,o(ki) -^(^O)) +9i-f (^gyr,i{Yi) - | +‘^^oOiSi2{r) +elsl{r), 

where Si(r), s^r) and Si2(r’) are defined in (jl.lSD . Thus, applying Lindeberg-Levy central 
limit theorem we get the relation Bn{k,r)-^6QW^^'^ + 6iW^'^\ This, together with (I4.22p 
gives 

\fkBn{k, r)AeoW^^'> + re ^ oo. (4.23) 

Applying Theorem 3.9 in [2|, from (14.211) and (I4.23P we get 

^{Bn{k,r),Gn{k,r))^{eoW^^^ +^lW^^\^x(eov^^\r)+elU^^\r)y^ , n^oo. 
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Continuous Mapping Theorem gives us the relation 

Vk {Bn{k, r) + Cn{k, r)) A-Oo (r)^ + 9i (r)^ , n —)> oo. 

The last relation together with (l4.2Up gives (|1.13p . Theorem 11.21 is proved. □ 


Proof of the Theorem li.,'9t In the case j = 1 the proof of the relation (ll.lhh can be found in 
|20] (proof of the Corollary 1) or in [1] (proof of Theorem 2). But the asymptotic normality 
of all estimators ^A{k,r), j = 1,2,3 can be obtained in a unihed way, expressing these 
estimators as functions of statistics Gnik, r, 0) and Gn{k, r, 1), and then combining Theorem 
0 Theorem O and Continuous mapping Theorem. Namely, it is easy to see that 

(1 - yr) {Gnik,r,0) - ^^,0) 


7n^(^T) - 7 = 


rGn{k,r, 0) 


(2) 

For the estimator 7^ {k,r) we have 


ln\k,r) - 7 = 


2(1 - -fr)Gn{k,r, 1) - 7 - 'y-s/ArGn{k,r, 1) + 1 
2rGn{k, r, 1) + 1 + Y^4rGn(A:,r, 1) + 1 


Multiplying the numerator and denominator of the right hand side by 2(1 — 7 r)Gn(fc, r, 1) — 
7 + 7 V^ 4 rG„(A;,r, 1) + 1, we get 

lnHk,r) -7 

^ 4(1 - 7 r)^ {Gnjk, r, 1) - g^.i) {Gnjk, r, 1) + ^,-, 1 ) - 47 {Gnjk, r, 1) - g^.i) ^ 

2rGn{k, r, 1 ) + 1 + y/ArGn{k,r, 1) + 1 

1 


2(1 - 'yr)Gn{k, r, 1) - 7 + 7^4rGn(fc,r, 1) + 1 
For the third estimator the following representation holds: 


AnHk,r)-'y = 


r(l - jr) {Gnik, r, 1) - ^r,i) - {Gn{k, r, 0) - ^^,0) 


r2G„(A:,r, 1) 

As it was said, it remains to combine Theorem ll.il Theorem [L2] and Continuous Mapping 
Theorem, and we deduce pi. 161) with 1 < j < 3. For example, we have 

fnHk,r) - 7 = / (yk{Gn{k,r,l) - C{r,l)),Ak{Gn{k,r,0) - ^(r, 0)), G„(/c, r, 1)^ 


with 


f{x,y,z) = 


r{\ — 'yr)x — y 


r'^z 


From Theorems 11.11 and O and Theorem 3.9 from [2] we have 

/fc {Gnik, r, i) - f,{r, i)) ,i = 0,1, Gn{k, r, 1)^ A + yA\r),i = 1, 2, ^(r, 1)^ , 
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and now we apply Continuous Mapping Theorem (Theorem 2.7 in [2]). 


□ 


As it was mentioned at the end of Introduction, there is possibility to use general approach 
in proving asymptotic normality of the introduced estimators, suggested in [7j. Let 
stands for the empirical d.f. based on the sample Xi,..., and let the empirical tail 
quantile function is defined as 

Qn{t) := F-^ ^1 - = Xn-lk„t],n, t E [ 0 , 1 ]. 

Then almost all known estimators of the extreme value index that are based on some part 
of largest order statistics can be written as some functional T (defined on some functional 
space), applied to Qn- Then, having estimator written as T{Qn), the idea in [7] is to use 
invariance principle for the process Qn in the functional space on which T is defined and then 
requiring some smoothness (Hadamard differentiability in linear topological space) one can 
try to derive asymptotic normality of the estimator under consideration. For estimators, 
considered in the paper, it is possible to use this scheme, but the functionals are quite 
complicated. For example, one can write 7n {k,r) = TcMRiQn), but 


Tgmr{z) = f{To{z),Ti{z),T 2 {z)), 

f{x,y,v) = ——, for r 7^ 0 and f{x,y,v) = for r = 0, 
f y zy 



For this complicated functional we must prove Hadamard differentiability on some linear 
topological space (to choose the appropriate space is also non trivial problem, usual Z)[0,1] 
space with Skorokhod topology is not a linear topological space). Thus, it seems that our 
approach is much more simple and we do not need more restrictive conditions (such that 
appears in 0), since instead of invariance principle for tail quantile function we prove two- 
dimensional CLT for two particular statistics and then apply continuous mapping theorem 
in R^. 

Proof of the Theorem The expression of B^^^(r) is given in [T], and at first we followed 
the pattern of the proof in [T], but, as it was noticed in Introduction, there is more simple 
proof. 

From the expression (ll.lTp . given in the Introduction, we see that, in order to find 
Bn\k, r), it is sufficient to find the limit lim3;_j.oo ^n\k, r; Ai,..., A„_i, x), since ^n-i{k — 
1,r; Ai,..., A„_i)^7, as n — 5- oo and (|1.7I) holds. For all three estimators calculations 
are simple and similar, therefore we demonstrate the proof for the estimator ^n {k,r,x), 
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having the most complicated expression. It is clear that, for sufficiently large value of x, 
{k,r‘, Xi,..., Xn-i, x) can be written as 


2 {h{x) + h) 

2r {h{x) + 5) + 1 + y^4r {h{x) + 6) + 1 ’ 


(4.24) 


where h{x) = <7^,1 {x/Xn-k,n) and b is the sum of the rest summands from statistic Gn{k, r, 1) 
and does not depend on x. If r > 0, then h{x) —)• 00 and the limit of the quantity in (|4.24p 
is 1/r, while for r < 0 h{x) —0, and the limit in (14.241) is 


2b 

2rh + 1 + V 4r6 + 1 

and this expression almost (this word is used for the reason that in the above written ex¬ 
pression the sum is divided by k, while for complete coincidence division should be by fe — 1 ) 
coincides with — 1, r; Xi,..., Xn-i), therefore, passing to the limit as n —>■ 00 we get 

in limit = 0. In the case r = 0 only nominator contains function h{x) = g'0,1 {x/Xn-k,n) 
which grows unboundedly, therefore, we get infinite value for Bn\k,r). □ 
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